Using Subgroup Discovery to Analyze the UK Traffic Data
نویسندگان
چکیده
Rule learning is typically used in solving classification and prediction tasks. However, learning of classification rules can be adapted also to subgroup discovery. Such an adaptation has already been done for the CN2 rule learning algorithm. In previous work this new algorithm, called CN2-SD, has been described in detail and applied to the well known UCI data sets. This paper summarizes the modifications needed for the adaptation of the CN2 rule learner to subgroup discovery and presents its application to a real-life data set the UK traffic data confirming its appropriateness for subgroup discovery in real-life applications through experimental comparison with the CN2 rule learning algorithm as well as through the evaluation of an expert. Furthermore we make the first step towards the comparison of the new CN2-SD algorithm to another state-of-the-art subgroup discovery algorithm SubgroupMiner by applying both algorithms to a slightly different data set the UK traffic challenge data set. The results of this application are presented in the form of ROC curves, showing CN2-SD’s potential in finding descriptions (subgroups) for minority classes, while SubgroupMiner found ‘better’ subgroups when trying to describe the majority class given the problem at hand.
منابع مشابه
ROC Analysis of Example Weighting in Subgroup Discovery
This paper presents two new ways of example weighting for subgroup discovery. The proposed example weighting schemes are applicable to any subgroup discovery algorithm that uses the weighted covering approach to discover interesting subgroups in data. To show the implications that the new example weighting schemes have on subgroup discovery, they were implemented in the APRIORI-SD algorithm. RO...
متن کاملAnalysis of Example Weighting in Subgroup Discovery by Comparison of Three Algorithms on a Real-life Data Set
This paper investigates the implications of example weighting in subgroup discovery by comparing three state-of-the-art subgroup discovery algorithms, APRIORI-SD, CN2-SD, and SubgroupMiner on a real-life data set. While both APRIORI-SD and CN2-SD use example weighting in the process of subgroup discovery, SubgroupMiner does not. Moreover, APRIORI-SD uses example weighting in the post-processing...
متن کاملCluster Based Cross Layer Intelligent Service Discovery for Mobile Ad-Hoc Networks
The ability to discover services in Mobile Ad hoc Network (MANET) is a major prerequisite. Cluster basedcross layer intelligent service discovery for MANET (CBISD) is cluster based architecture, caching ofsemantic details of services and intelligent forwarding using network layer mechanisms. The cluster basedarchitecture using semantic knowledge provides scalability and accuracy. Also, the mini...
متن کاملAPRIORI-SD: Adapting Association Rule Learning to Subgroup Discovery
& This paper presents a subgroup discovery algorithm APRIORI-SD, developed by adapting association rule learning to subgroup discovery. The paper contributes to subgroup discovery, to a better understanding of the weighted covering algorithm, and the properties of the weighted relative accuracy heuristic by analyzing their performance in the ROC space. An experimental comparison with rule learn...
متن کاملA Systematic Method to Analyze Transport Networks: Considering Traffic Shifts
Current network modeling practices usually assess the network performance at specified time interval, i.e. every 5 or 10 years time horizon. Furthermore, they are usually based on partially predictable data, which are being generated through various stochastic procedures. In this research, a new quantitative based methodology which combines combinatorial optimization modeling and transportation...
متن کامل